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Nasturtium Fatty Add Elongase (FAE) gene and its use In increasing emcic acid content 



Bacltgronnd to tlie invention 

Very long chain fatty acids (VLCFAs) with 20 carbons or more are widely distributed in 
nature. In plants they are mainly found in epicuticular waxes and in the seed oik of a number of 
plant species, including members of the Brassicaceae,Limnan(heceae, Simmondsia and 
Tropaeolaceae. A strategic goal in oilseed modification is to genetically manipulate high erucic 
acid (HEA) germplasm of the Brassicaceae to increase the content of erucic acid (22:1 A13) and 
other strategic VLCFAs in the seed oil for industrial niche market needs. Erucic acid and its 
derivatives are feedstocks in manufacturing slip-promoting agents, surfectants, plasticizers, nylon 
1313, and surface coatings and more than 1000 patents have been issued. The current market for 
higji arucate oils exceeds $120 million U.Syannura. Worldwide erucic acid demand is predicted to 
increase fixnn about 40 million pounds (M pds) in 1990 to about 80 M pds by the year 2010. 
Similarly, demand for the derivative, behenic acid, is predicted to triple to about 102 M pds by 
2010. In recent years, production has mcreased to meet maricet needs and high erucic acreage in 
westem Canada is currently at a record high. ABrassica cultivar conlaming erucic acid levels 
approaching 80% would significantly reduce the cost of producing erucic acid and its derivatives 
and could meet flie forecast demand for erucic and bdiauc acids as renewable, environmentally- 
Iriendiy industrial feedstocks. 

VLCFAs are syntfaesi^d outside the plastid by a membrane bound fetty acid elongation 
complex (elongase) using acyl-CoA substrates. The first reaction of elongation involves 
condensation of malonyl-CoA with a long chain substrate producmg a 3-ketoacyl-CoA. 
Subsequent reactions are reduction of 3-hydroxyacyl-CoA, dehydration to an enoyl-CoA, 
followed by a second reduction to form the elongated acyl-CoA. The 3-ketoacyl.CoA synthase 
(KCS) catalyzing the condensation reaction plays a key role in determining the chain length of 
fatty acid products found in seed oils and is the rate-limiting enzyme for seed VLCFA production. 
The composition of the fatty acyl-CoA pool available for elongation and the presence and size of 
the neutral lipid sink are additional important factora influencing the types and levels of VLCFAs 
made in particular cells. 

Our knowledge of die mechanism of elongation and properties of FAEl and other 
elongase condensing enzymes is, in part, limited by theu* membrane-bound nature: as such they 
are more difficult to isolate and characterize than soluble condensing enzymes. The genes 
encoding FAEl and its homologs have been cloned fiom Arabidopsis ihaliana and from Brassica 
napus (two homologous sequences, Bn-FAEl .1 and Bn-FAE 1 .2). 



Site-directed mutagenesis experiments have been carried out on QxeArabidopsis FAEl to 
decipher the importance of cysteine and histidine as residues in condensing enzyme catalysis. 
Results have shown that cysteine^ and four histidine residues are essential for the enzyme 
activity. 

In this \vork» we selected Tropaeolum majus, garden nasturtium, as a source of the 
elongase involved in VLCFA synthesis based on flie fact that this plant is capable of producing 
significant amounts of arucic acid (70-75 % of total fatty acicQ and accumulates trierucin as flie 
predominant triacylglycerol (TAG) in its seed oil. Here, we report the isolation of a nasturtium 
FAE gene and demonstrate the involvement of its encoded protein in the elongation of saturated 
and especially monounsaturated &tly adds. 

This invention relates to a nasturtium cDNA encoding an "elongase" (condensing 
enzyme) wifli a high specificify for eicosoioyl moieties wWch can be utilized to engineer seed oil 
crops for production of high erucic acid oils. 

There is interest in modifying Ifae seed oil fat^ acid composition and content of oilseeds 
by molecular genetic means to provide a dependable source of Super High &ucic Acid Rapeseed 
(SHEAR) oil for use as an industrial feedstock. 

Nonetheless, to date, increases in the content of some strategic fatty adds have beoi 
achieved by introduction of various fetty acid biosynthesis genes in oilseeds. Some examples 
include: 

Expression of a medium chain fatty acid thioesterase firom Califorraa Bay, in 
Brassicaceae to increase the lauric acid content (Calgene) 

Expression of an anti-sense construct to (he A9 desaturase in Brassicaceae to 
increase the stearic acid content (Calgene) 

Increased proportions of oleic acid by co-suppression using constructs encoding 
plant microsomal desaturases. (DuPont^Cargill) 

Expression of z Jojoba "e!ongase"3-keto-acyl-CoA synthase in low erucic acid 
(canola) B. napus cultivars to increase the level of crude acid: flie effect following expression in 
high erucic acid cultivars was negligible (Calgene, Lassneret al., 1996). 

However, there has not been an elongase gene identified or characterized as encoding an 
FAE with the ability to produce 22: 1 beyond the level aheady existing in HEARS, napus 
cultivars. 

We considered that the isolated FAE "elongase" homolog from Tropaeolum mqfus 
(garden nasturtium) with GenBank Accession No. AY082610 (published on March 6* 2002), 
could be used to engineer plants to produce seed oUs higjhly <mriched in crude add. We found Oiat 



to date, this is the first "elongase" transgcne experiment to result in a 5-Wold increase in the 
proportions erucic acid in plants. 

To our knowledge the nearest ait relates to an elongase gene (FAEl) from Arabidopsis 
\yiiich was cloned and published as: James, D.W. Jr., Lim, E, Keller, J., Plooy. L, RaIston,E. and 
Dooncr, H.K, (1995) Directed tagging of Ae Arabidopsis FATTY ACID ELONGATIONl 
(FAEI) gene with the maize transposon activator. The Plant Cell 7: 309-319 (1995). 

TTie reader is also referred to sequences 7, 9, 11,13, 15. 17, 19, 21, 23, 25, 27, 29. 35, 37. 
39, 41 from Jaworski, J.G. and Blacklock. BJ. Patent Application WOOl 94565 as well as 
sequences 19, 20, 21, 22, 23 from Kunst and Clemens, Regulation of embryonic transcription in 
plants. Patent Application WOOl 1 1061; 15-FEB.2001; University of British Columbia (CA). 

Summary of the Invention 

The invention relates to an expression vector for transformmg a plant cell, said expression 
vector comprising a g^e codmg for a nasturtium fat^ acid elongase gene in reading frame 
alignment wifli a promoter capable of increasing repression of said gene, ^en said transformed 
plant cell is m a seed, sufficient to increase in proportion of very long chain monounsaturated fat^ 
acid when compared with a control plant cell. The invention also relates to a plant cell comprising 
a heterologous gene coding for a nasturtium fatty acid elongase gene or allelic variant thereof, 
said plant cell being capable of iax>ducing an increase, preferably at least a 10% increase, in 
proportion of a very long chain monounsaturated &tly acid M^en compared witii a control plant 
cell lacking said heterologous gene. The increase can be larger, c,g. up to about five or sisg-fold. 
The invention also relates to seeds and plants comprismg such plant cells and the use of such 
vectors to produce such plant cells, seeds and plants. The plant preferably is a dicotyledon, 
especially a membw of the Brassicaceae, Limnanthaceae, Tropaeolaceae or Simmondsia. 

The fatty acid elongase (often designated FAE or 3-ketoacyl-CoA synthase (KCS)) is a 
condensing enzyme and is the first component of the elongation complex involved in synthesis of 
erucic acid (22: 1) in seeds of Tropaeolum majus (garden nasturtium). Usmg a degenerate primer 
approach a cDNA of a putative embryo FAE was obtained showing high homology to known 
plant elongases. This cDNA contains a 15 12-nucleotide open reading frame (ORF) that encodes a 
protein of S04 amino acids. A genomic clone of the nasturtium FAE was isolated and sequence 
analyses indicated the absence of introns. NorAem hybridization showed the expression of Ois 
nasturtium FAB gene to be restricted to the embryo. SouAem hybridization revealed the 
nasturtium 3-ketoacyl-CoA synthase to be encoded by a small mulitigcne family. To establish the 
function of the elongase homolog, the cDNA was introduced into two difTerent heterologous 
chromosomal backgrounds, Arabidopsis (A. ihaliana) and tobacco (Nicotlana tabacum), under the 



control of a seed-specific (napin) promoter and the tandem 35S promoter, respectively. Seed- 
specific expression resulted in up to a 6^old increase in erucic acid propoitions in Arobidopsis 
seed oil. Constitutive expression in transgenic tobacco tissue resulted in increased proportions of 
very long chain saturated fatty acids. These results indicate that the nasturtium FAE gene aicodes 
a condensing en^e mvolved in the biosynthesis of v»y-long-chain fatty acids, utiliang 
monounsaturated and saturated acyl substrates. It shows utility for directing or engineering 
increased synthesis of erucic acid in other plants* 
Brief description of the Figures 

Figure 1. Substrate specificity of elongase(s) from mid-developing nasturtium (r. majus) 
embryos. 200 wg of protein from a 15,000 x g particulate fi:action was used in ttie elongase assay. 
Reaction conditions were as described in Materials and Methods. Results represent the average of 
three replicates. For each [l-'^q-acyl-CoA substrate, the relative proportional distribution of 
radiolabeled fatty acid elongation pn>duct(s) is(aie) demarcated. 

Figure 2 A: Comparison of the amino acid sequences of the nasturtium FAE homolog 
(NasFAl^ accession no. AY0826190) with fetty acid elongasel (FAEl) and related 3-tetoa<yl. 
C6A qoiAases from other plaiit species. The alignment contains the sequences of the com 
(ZeaFAE), Umnanthes (LimFAE), jojoba (SimFAE), Arabidopis (AiaFAE) Brassica (BraFAE) 
and two Arabidopis 3-ketoacyl-CoA synthases associated with wax ^thesis (AraKCS, 
AraCUT). The GenBank Accession numbers for the sequences shown are AJ292770 (ZeaFAE), 
AF247I34 (LimFAE), U37088 (SimFAE). AF053345 (AraKCS), AF1295U (AraACUT), 
U29142 (AraPAE), AF009563 (BraFAE). ConsCTved cysteine and histidme residues are labeled 
with diamonds and triangles, respectively. Tyrosine at position 429 in the nasturtium FAE 
polypeptide is indicated by an asterisk. 

B: Dendrogram of the 3-ketoacyl-CoA synthase gene femily based on the amino acid 
sequences. The alignment was carried out by the Qustal W method using Lasergene analysis 
software (DNAStar, Madison, WI) GenBank accession numbers: AF247134 (LmiFAE), U37088 
(SimFAE), AY082610 (NasFAE), AJ292770 (ZeaFAE), AF053345 (AiaKCS), U29142 
(AraFAE), AF009563 (BraFAE), AF12951 1 (AraCUT). 

Figure 3. Hydropathy analysis of T. majus FAE. A: Hydropathy plot of FAE indicating 
the presence of several hydrophobic regions. B: Schematic i^resentation of flie putative 
transmembrane domains of 71 majus FAE amino-acid sequence as predicted by TMAP analysis 
(Persson and Argos 1994), Numbers shown in tiie boxes correspond to tiie residues of each 
domain in FAR 



Figure 4. Northern and Southon analyses of T. maJusFAE. 

A: Nortfiern analysis of FAE gene expression in T. majus. Total RNA was isolated from 
roots (RT), leaves (LF), petals (PL) and embryos (EO). B: Soufliem blot analysis of the FAE gene 
in r majus. Genomic DNA was digested with restriction enzymes: EcoSl (lane 1), Accl Qane 2), 
Ncol Oane 3) and Hindm Qane 4). 

Figure 5. A. Proportions of 20:1 All and 22:1 A13 in seed oils from non-transfomiedi4. 
thaliana ecotype Wassilewskija (WS-Con), two plasmid only transgenic control lines (RDt and 
RD-15), and the eighteen best A thaliana T2 transgenic lines «q)ressing the T. majus FAE gene 
under control of the napin promoter. B. Proportions of 18:0, 20:0, 22:0 and 24:0 in seed oils from 
non-transformed thaliana ecotype Wasilcwskija (WS-Con), two plasmid only transgenic 
control lines (RDl- and RD-15), and the eighteen hesXA. thaliana transgenic lines expressing 
the Z majus FAE gene under control of the napin promoter. The values are the average ± SD of 
three determinations performed on 200-seed lots. 

DetaOed Description of the Invention 

Example 1 

Plant materials 

All ^perimental lines propagated in the greenhouse wm grown at the Kristjanson 
Biotechnology Comply greenhousesi Saskatoon, under natural light conditions supplemented 
with hi^-pressure sodium lamps wiOi a 16 h photoperiod (16 h of light and 8 h of darlcness) at 
22"C and arelalive humidity of 25 to 30%. Trqpaeolum majus plants (cultivar Dwarf Chcny 
Rose) were grown m the greenhouse and flowers were hand-pollinated. Seeds at various stages of 
development were harvested, their seed-coats were removed and embiyos were frozen in liquid 
nitrogen and stored at -80^C. Tobacco plants were grown under sterile conditions on MS medium 
^urashige and Skoog, 1962) as well as under normal greenhouse conditions. Arahidopsis plants 
were grown m a growth chamber at 22^C with photoperiod of 16 h light (120 |iE-m*V) and 8 h 
dark. 



Nasturtium embryo protein preparations and elongase assays 

A 5-15,000 particulate fraction enriched in elongase activity was isolated essentially 
according to L6hden and Frentzen (1992). Briefly, embryos (2-3 grams) were ground in a mortar 
under liquid nitrogen and then 10 ml of IB buffer (80 mM HEPES pH 7.2, containing 2 mM DTT, 
320 mM sucrose and 5% PVPP) per g fresh weight was added. The homogenate was filtered 
through Miracloth and spun for 5 min at 5, 000 jc g in a Sorvall refrigerated centrifiige at 5 °C, the 
supernatant retained and re-centrifiiged at 15, 000 x g for 25 min. The resulting pellet was 
resuspended in 80 mM HEPES containing 20% glycerol and 2 mM DTT. The concentration of 
protein was determined hy the method of Bradford (1976). This subcellular fraction was either 
used directly to determme enzymatic activities or stmd at -80°C until used. 

The 1 5,000 x g particulate preparation was used to p^orm elongation assays as described 
by Taylor et al., (1992a & b) with the following modifications: The assay mixture consisted of 80 
mM HEPES-NaOH. pH 7.2 containing 0.75 mM ATP, 10 pM C6A-SH, 0.5 mM NADH, 0.5 mM 
NADPH, 2 mM MgOj, 200 nM malonyl-CoA, 18 pM [l-"q acyl-CoA (0.37 GBq taoY^) and 
nasturtium protein in a fbal volume of 500 pL. The reaction was started by the addition of 200 
pmg of protein and incubated in a shaking water-baft at 30"C, 100 rpm for 0.5 h. [1 -*^C> 
Radiolabeled acyl-CoAs were synthesized from die conresponding free fiitty acids as described 
previously by Taylor ct al., (1990). Elongase reaction assays were stopped with 3 mL of lOOgL ' 
KOH in methanol. Fatly acid mefiiyl esters (FAMEs) were prepared and quantified by radio- 
HPLC as described previously (Taylor et ah, 1992b). 

Lipid analyses 

The total &tty acid content and acyl composition of tobacco plant lipids mdArabidopsis 
seed oils was determmed by GC of flie FAMEs with 17:0 FAME as an internal standard as 
described previously (Zou et al., 1 997; Katavic et al., 2001; Taylor et al., 2001) 

Isolation of FAE cDNA by a degenerate primers approach 

Degenerate primers were designed for amino acid sequences consenred among 
Arabidopsis thaliana KCSl (AF053345). Brassica napus FAEI (AF009563), Limnanthes 
douglasii FAE (AF247134) and Simmondsia chinensis 7AE (U37088). Single^tranded cDNA 
template for reverse transcriptase-PCR was synthesized at 42*^0 from embryo poly (A) RNA wifli 
PowerScript™ (aontech). A 50 |iL PGR reaction contained smgle-stranded cDNA derived from 
40 ng of poly (A) RNA, 20 pM of each primer: F-forward 
TCT(An^GG(A/DGG(aA)ATGGGTTG [LGGMGC], F-reverse 

T(G/A)TA(T/C)GC(Cyr)A(A/G)CTC(A/G)TACC [WYELAY] and 2.5 U of Taq DNA 
Polymmse (Amersham) under standard conditions. An internal part of the elongase sequence was 



amplified in a thermolcycler during 30 cycles of Ae following program: 94*^C for 30 sec, 48^C for 
30 sec and 72"C for I min. The sequence of a 650-bp PGR product was used to design a primer to 
amplify the 5' and 3' ends of the cDNA using the SMART™ RACE cDNA Amplification Kit 
(CLONTECH). After assembly to determine the full length sequaice of the cDNA, the open 
reading firame (ORF) was amplified using the primers P- forward 
ACCATGTCAGGAACAAAAGC and PR-reverse TTAATTTAATGGAACCTCAACCG, and 
subsequently cloned into the pYES2 expression vector (Invitrogcn). 
cDNA library construction 

To construct the nasturtium developing cDNA library, immature seeds were collected 17 
d^s after pollination. Total RNA was extracted fit)m embryos according to Lindstrom and 
VodWn (1991), then poly (A) RNA was isolated using Dynabeads 01igo(d'I)25 (DYNAL). Copy 
DNA synthesis was performed on 1 jig of poly (A) RNA using SMART PCR cDNA Synthesis Kit 
(Clontech) according to manu&cturer*s protocol The cDNA population was dien subtracted with 
12S and 28 seed storage protein cDNA clones using PCR-Select cDNA Subtraction Kit 
(Qontech). The subtracted embryo cDNA population was cloned and then sequenced as described 
byJakoetaL(20Q2). 

Sequence handling 

Sequence analyses were performed using Lasergene software (DNAStar). Sequence 
similariQr searches and other analyses were performed using BLASTN, BLASTX (Altschul et al., 
1990) and PSORT (Nakai and Kanehisa, 1992) programs. 

Site directed mutagenesis ofFAE 

A site-directed mutagenesis ^cperiment was performed essentially as described previously 
(Katavic et al., 2002). The desired mutation (tyrosine at positim 429 is replaced with histidine) 
was introduced into the FAE coding region by polymerase chain reaction using primers Fl- 
forward TCGAGGATGTCGCTTCACCGATTTGGAAACAC and Rl-reverse 
GTTTCCAAATCGGTCAAGCGACATCCTCGATGG. Primers were complementary to the 
opposite strands of pYES2. l/V5-His-T0P0 containing the nasturtium FAE gene. 

Northern analysis 

Total RNA from nasturtium plant material was isolated according to Lindstrom and 
Vodldn (1991). 20 microgram of RNA was firactionated on a 1.4% formaldehyde-agarose gel and 
the gels were then stained with ethidium bromide to ensure that all lanes had been loaded equally 
(Sambrook et al., 1989). The RNA was subsequently transfenred to Hybond N* membrane and 
hybridized with the ^ labeled FAE DNA probe, prepared using the Random Primes DNA 
labeling kit (Gibco-BRL, Cleveland). Membranes were hybridized at 60^C ovemight 



Plant transformation vectors 

The coding regions of (he nasturtium FAE (natural and mutated versions named SF and 
SMF, respectively) were amplified by polymerase chain reaction with primers BFforwaiti: 
taggatccATGTCAGGAACAAAAGC (lower case indicates the restriction site for BamHI); and 
SR-reveree tagagctcTTAATTrAATGQAACXrrCAACC (lower case indicates the restriction site 
for Sacl enzyme) and subsequently cloned as a AwiHI and Sad ISragment behind the constitutive 
3SS promoter in binary vector pBI121 (CLONTECH). 

Hie coding region of the nasturtium FAE was cloned behind flie seed-specific napin 
promoter as follows: ABarnHL site was introduced in front of the start codon and behind the stop 
codon of FAE by PCk with primers BF (as above) and BR-reverse: 
taggatccTTAATTTAATGGAACCTCAACC Gower case indicates the restriction site forBamBJ). 
TTie B. napus napm promoter was cloned in HmdUVXbaL sites of flie pUC19 (Feraientas) and the 
nos terminator was mtroduced as an EcoKUBamm fragment. The resulting vector was named 
pDHL The napin promoter/nos terminator cassette was excised by HbidOI/EcdBl digestion and 
subsequently cloned into the respective sites of pRI>400 (Clontech) resulting in pVKl. Hie 
coding region of FAE was then cloned into the BamHI site of pVKl behind the napin promoter 
and the resulting vector was named NF. Sense orientation of the FAE coding region wiA respect 
to the promoter was confirmed by restriction analyses with Xbal. 

The final binary vectors (SF: 35S-FAE, SMF: SSS-Mutated FAE, and NF: napin:FAE) 
were electroporatcd into Agrobactenum tumefaciens cells strain QV3101 containing helper 
plasmid pMP90. Plasraid integrity was verified by DNA sequencing following its re-isolation 
fcomA. thumefaciens and transformation into£. colL 

Plant transformation and genetic analysis 

Tobacco (Nicotiana tabacum cv. Xanthi) was transformed using a leaf disc transformation 
procedure (Horsch et al., 1985), Shoots that rooted in the presence of 50 |ig/mL kanamycin were 
considered to be transgenic. Transgenic plants were transferred to soil and grown in the 
greenhouse. 

Arabidopis (A. ihaliana eco^ Wassilewskija) were transformed by vacuum infiltration 
according to the method of Clougji and Bent (1998). Transgenic plants were selected and analyzed 
essentially as described by Jako et al., (2001). 

Molecular analysis of transgenic plants 

DNA was isolated from M g of tobacco or 150 mg of Arabidopsis leaf material using a 
urea-phenol extraction method (Chen et al., 1992) with the following minor modification: 
Material was frozen in liquid nitrogen and tept at-80^C until used. Extraction was performed for 



15 mm at room temperature and 400 mM ammonium acetate, pH 52 was used for the first two 
precipitation steps. Stable integration of the napimFi4£:nos cassette into the genome of transgenic 
plants was checked by PGR amplification on genomic DNA with NN3 and NN4 primes as 
described by Katavic et al, (2001). 

Southern analyses were performed to further confirm and select those transformants 
containmg single or multiple copies of the inserted fragments. 15 microgram of tobacco or 1 
microgram of Arabidopsis genomic DNA was digested with the restriction enzyme5acl, and the 
resulting j&agments were separated on a 05% (w/v) agarose gel, transferred to Hybond nylon 
membrane (Amersham) via an alkali blotting protocol. A 1.5 Kbp probe containing the coding 
sequence otFAE was generated by polymerase chain reaction (PGR) using primers: Pl-forward 
ATGTCAGGAACAAAAGC and P2-reverse TAATrrAATGGAAGCTCAACCQ and 
subsequently radioactively labeled with^¥ as described above. Hybridization was performed at 
65"G. The filters wot washed once in Ix SSPE, 0.1% SDS for 15 min and in O.lx SSPE, 0.1% 
SDS for S-10 min at the temperature of iQ^dization. The blots were developed by exposure to 
X-OMAT-AR film (Kodak. Rochester, NY). 

To estimate the number ofFAE isoforms in the T. majus genome, 15 microgram of 
genomic DNA was digested with restriction enqrmes: JfcoRI, Accl, Ncol and Hindm, Blotting 
and hybridization conditions were essentially as above except Aat filters were washed at low 
stringency with Ix SSPE, 0.1% SDS for 15 min, autoradiographed and then washed subsequently 
with O.lx SSPE, 0.1% SDS, and re-exposed. 

Example 2 

Acyl Composition of T. majus cv Dwarf Cherry Rose 

TTie acyl composition of the TAG fraction of this cultivar was typical in thatit had highly 
enriched proportions of very long chain monounsaturated fatty acids (VLGMFAs), particularly 
22:1 (77.5%) and 20:1 (16.0%) with a trace of 24:1 (1.5%), and a low proportion of total C,8 fatty 
acids (2.5%), primarily 18:1 (2.4%). The predominant TAG species were trierucin followed by 
22:1/20:1/22:1 (Taylor etaL, 1992a). 

Example 3 

Substrate specificity of nasturtium embryo elongases in vUro 
Although there has been considerable debate regarding the Bicyl substrate for elongase 
activity in developing oilseeds, recent studies of developing seeds of A napus have revealed the 
presence of two types of elongation activity in vitro: an acyl-C6A-dependent activity, and an 
ATP-dependent activity which apparently utilizes an radogenous acyl primer, A 15,000 ;r g 
particulate fraction was isolated from nasturtium embiyos collected at mid-development (at 14-17 



days after pollination^ the stage which exhibited die highest enrichment in acyl-CoA-dependoit 
elongase activity. 

It has been shown that while ATP is necessary for acyl-CoA-dependent elongation in 
vitro, too high a concentration of ATP strongly inhibited elongase activity. In addition, elongase 
enzyme activity has been reported to be stimulated by the prraence of 10 ^ CoASH. In order to 
optimize reaction conditions, we assessed the roles of thrae two co-factors. Elongase activity was 
measured in vitro m the 15,000 x g particulate ftaction from nasturtium embryos under different 
ATT concentrations (0-5 mM)m the presence of 10 pM CoASH with 18 fiM l-[*'*C]-18:l-CoA 
and 200 malonyl-CoA. The highest activity was found at a concentration of 0 J5mM of ATP. 
Then, elongase activity was examined with range of [l-"C]-acyl-CoAs substrates at an ATP 
concentration of OJSmM in the presaice of 10 CoASH. 

Our results hidicate tiiat in a developing nasturtium embiyo particulate fraction, acyl- 
CoA-dependent elongases have the capacity to elongate a wide range of saturated (C16-C20) and 
monounsaturated (Cw and C^o) fatty acyl moieties (Fig. 1). Of the [1-** C]-Iabeled acyl-CoA 
series (16:0-CoA, 18:0-CqA, 18:1^A. 20:0-Q)A, 20:1-CoA. 22:1^oA), tested in vitro, 
elongase(s) from mid-developing nasturtium embryos exhibited tiie highest activity with 18:1- 
CoA and 20: l-CoA (102 and 95 pmol/min/mg protein, respectively). These elongase activity rates 
are of the same order of magnitude as that reported for acyl-CoA eIon^e(s) in a similar 
particulate fraction from developing rapeseed embryos. The particulate fraction was also able to 
elongate, in order of specificity, the saturated substrates 18:0-CoA, 16:0-CoA, and to a much 
lesser extent, 20:0-CoA. hi general, regardless of die H"CJacyl-CoA substrate supplied in vitro, 
the major labeled &tty acyl product was the C2 extension of its respective i^ursor (about 80- 
90%), with the next respective C4 extension product being present in proportions of about 10*20% 
(Figure 1). The one critical exception to this trend was flie production solely of radiolabeled erucic 
acid from its respective 1-[*'*C] eicosenoyl-CoA precursor. There was no detectable elongation of 
l-[*^C]-labeled 22:l-CoA to 24:1, even though the latter is found m trace amounts in nasturtium 
seed oil. 



Example 4 

Isolation of T. majus PAE homolog 

Based on sequence homology among plant fatty acid elongase genes, a fulUcngth clone 
was ampliQed by PGR using a degenerate primers approach and flie sequence submitted to the 
GenBank (accession number AY0826 10; Figure 2 (A)). The nucleotide sequence had an open 
reading frame of 1512 bp. Subsequently, 3 partial clones of about 0.6 W), r^resentative of the 
AY083610 FAE clone, were found among 2,800 ESTs surveyed (about 0.1% representation) from 
a nasturtium embryo subtracted cDNA library. • 

Alignment of the amino acid sequence of the nasturtium FAE with other plant condensing 
en^es revealed the presence of six conserved cysteine residues (Fig 2 A). Further sequence 
analysis showed that one out of the four conserved histidine residues suggested to be important for 
Arabidopsis FAEl activi^, was substituted with tyrosine in the T. majus FAE polypeptide. 

An analysis of the nucleotide sequence of the corresponding nasturtium FAE genomic 
clone revealed the absence of intron sequences. A similar absence of mtrons was observed in 
homologs from A. ihaltana FAEI, rapeseed CE7 and CE8 and high and low erucic lines of A 
oleracea, B. rapa, canola B. ncpus cv Westar and HEAR A napus cv Hero. 

The r. majus FAE cDNA encodes a pobpqptide of 504 amino acids ftat is most closely 
related to an FAE2 from roots of Zea mays (69 % amino acid identify) (Fig. 2 (B)). The T. majus 
FAB polypeptide also shared strong identify with FAEs from Idmnanthes douglasii (67%) and 
from seeds of jojoba {Simondsia chinensis) (63%). Homology of the nasturtium FAE to two 
Arabidopsis 3-ketoacyl-CoA synthases AraKCS and AraCUTl) involved in cuticular wax 
synthesis was on the level of 57% and 53%, respectively. These homologs all exhibit the 
capabilify to elongate saturated fatfy acids to produce saturated VLCFAs. The FAEl polypeptides 
involved in the synthesis of VLCFAs in Arabidopsis and Brassica seeds showed approximately 
52-54% identify with the T, majus FAE. The nasturtium FAE protein was predicted to have a 
theoretical pi value of 93 using the algorithm of Bjeliqvist et al, (1993 and 1994) and a 
molecular mass of 56.8 kDA, which are similar to tiie respective values reported for the B, napus 
CE7 and CE8 FAE homologs as well as those from B. rapa (campesiris) and A oleracea, 

A hydropaAy analysis (Kyte-Doolitlle) of the amino acid sequoice of the T. majusFAE 
revealed several hydrophobic domains (Fig. 3A). Protdn analyses with the TMAP algoriflun 
(Person and Argos, 1994) predicted twoAT-terminal transmembrane domains, the first 
corresponding to amino-acid residues 35-55, and the second spannmg residues 68-88 (Fig 3B). 
Like other elongase condensmg enzymes, the T. majus FAE lacks iV-terminal signal sequences 
typically found for plastidial or mitochondrial-targeted plant enzymes. It also lacks a KXKXX or 



KKXX motif (X«any amino acid) often found at the Ctenninus of protdns retained within ER 
membranes. Rather, it is a type ma protem, typically present on endoplasmic reticular 
membranes. 

£xample 5 

Tissue specific expression and copy number estimate of T. majus FAE 

Northern blot analyses were performed to investigate the expression profile of &o FAE 
gene. Total RNA was isolated from different nasturtium tissues includmg roots, leaves^ floral 
petals and mid-developing embryos. A strong hybridization signal withFi^-specific probe was 
observed only with RNA isolated from developing embryos (Fig. 4. A). 

A Southern blot hybridization was performed wifii nasturtium genomic DNA digested 
with several restriction enzymes including £^oRI, Acd, Ncdl and HimBR. The FAE gene has no 
intmial EcoBI^ AcA or Ncol sites, while one internal HindSL site exists. Autoradiography 
revraled the presence of one strongly-hybridizing fragment in all cases except with HindSO. for 
which two strongly hybridizing fragments were evident 0?ig. AS). In addition a minimum of 4 
weakly hybridizing fragments were detected After washmg under high stringency conditions, the 
numbo- of hybridizmg fragments was unchanged. Thus, we have concluded that T. majus FAE 
belongs to a multigenic family consisting of 4 to 6 members. A similar multigenic femily has been 
found for a nq[ieseed FAEl gene member. 

Example 6 

Heterologous expression of (he T majus FAE in Yeast 

To study the function of the protein encoded by the 7*. mtgus FAE, Oie coding region was 
linked to the galactose-inducible GALl promoter in tiie expression vector pYES2 and transformed 
into yeast Transgenic yeast cells harbouring the T. majus FAE did not show any difference in 
fatty acid composition in comparison to yeast cells transformed with empty vector. A similar 
difficulty with expression of Limnanthes FAE in yeast cells has been reported. 

As indicated earlier, a comparison of the predicted amino acid sequence of the nasturtium 
FAE widi other plant condensing enzymes (Fig 2 A) showed that one of the four conserved 
histidine residues, known suggested to be important for Arabidopsis FAEl activity, was 
substituted with tyrosine in the r. majus FAE polypeptide. To study the importance of this 
histidine residue for enqone activity, we used a site directed mutagenesis approach to replace the 
tyrosine 429 residue with histidine. This mutated version of nasturtium FAE was expressed in 
yeast cells. Analyses of fatty acid composition of transfonned yeast cells showed fliat histidine at 
position 429 did not restore enzyme activity. Therefore we decided to study the function of 71 
majus FAE in plant heterologous chromosomal backgrounds. 



Example 7 

Expression of majus FAE in tobacco plants 

To establish fimctional identity, the cDNA for the FAE-related polypeptides vm 
constitutively expressed in tobacco plants imder the control of the tandem 35S constitutive 
promoter. In addition, to assess the importance of histidine residues for emymc activity, the 
tyrosbe at position 429 in the nasturtium FAE was replaced with histidine and subsequently used 
to prepare a plant transformation vector under the control of the tandem 35S promoter. Integration 
of the 3SS/Fi4£yNos expression cassette into tobacco plants was confirmed by PCR amplification 
on gnomic DNA. Fatty acid composition was determined m callus, leaves and seeds of 
transgenic tobacco plants. 

Ccmstitutive eiqiression of tiie nasturtium FAE homologue in tobacco <^iis resulted in an 
increase in {Hoportions of VLCFAs fiom 3.7 % in the wild type background to as Ugh as 8.6% (a 
132% increase) in transgoiic lines (Table I). In particular, the increase in pnq)ortions of saturated 
VLCFAs (22:0, 24:0, 26:0) was most pronounced. The fact ftat the synthesis of fee saturated 
VLCFAs occurs at the expense of 16:0 and 18:0 suggests tiiat die nasturtium FAE is able to 
elongate C|6 and C|8 acids. Expression of die mutated version of die nasturtium FAE (SMF) 
resulted in a slight increase In die VLCFA content in tobacco callus, on average 1 8.5% in 
comparison to die wild type background. Increased prcq[)ortions of VLCFAs at die expense of 
LCFAs was observed in leaves of transgenic tobacco plants carrying eidi^ the nasturtium FAE or 
its mutated form (Table II). Comparison of fatty acid composition in tobacco tissues upon 
expression of the nasturtium FAE and its mutated version, revealed that tyrosine at position 429 is 
likely important to achieve full activity of the enzyme. A decreased proportion of 18:3 in leaves of 
transgenic tobacco lines in comparison to the wild type (empty vector) background suggests that 
the **metabolic pull" of die elongation padiway may be somewhat stronger than that of die 
competing desaturation pathway. 

Expression of nasturtium FAE in tobacco seeds resulted in a 50% increase in proportions 
of VLCFAs from 0.6% ui die wild type background to 0.9% m transgenic plants (data not shown). 
The relativefy low proportions of VLCFAs in tobacco leaves and callus (sec Tables I and II) may 
be an indication diat (i) in vivo, saturated fot^ acids are not present at high concentrations; 
dierefore even a 50% increase in relative proportions does not result in high levels of VLC 
saturated &tty acids accumulatuig in glycerolipids; (ii) expression of die nasturtium FAB when 
under the control of die 3SS promoter is relatively weak. 



Example 8 

Expression of T. majus FAEiaArabidopsis seeds 

Since expression of nasturtium FAE under the control of flie 35S promoter did not result 
in a high accumulation of VLCFA in tobacco seeds we decided to study the effect of expressing it 
in Arabidopsis. Using a vacuum-infiltration melhod, 18 kanamycin resistant Arabidopsis plants 
were obtained. The fatty acid composition of T2 seeds was determined. A significant increase was 
observed only in the content of erucic acid (22:1 cl3). On average, the level of erucic acid 
increased up to 3.2% (a 50% increase) m transgenic seeds comparing to 2. 1% in wild type 
background (data not shown). In tiie best transgenic lines, the contoit of erucic acid increased up 
to 4.0% (a 90% increase). 

Since tandem 3SS-driven constitutive FAE expression did not result in a strong increase in 
VLCFA proportions in tobacco and Arabidopsis seeds, we decided to use the seed-specific 
promoter napin to stud/ FAE expression in an Arabidopsis seed background. Frcnn vacuum- 
infiltration experiments, 25 kanamycin^resistant Ti plants were selected. The Taprogeny were 
collected individually fiom each plant and the fitty acid composition determined. Significant 
changes in fatty acid composition in comparison to the wild ^e (empty vector) were found. On 
average, flie proportion of erucic acid (22: 1 A13) increased fiom 2.1% in wild type to 9.6% in T2 
transgenic seeds at the expense of 20:1 Al 1 (Table IIQ. Eighteen of tiie best transgenic lines were 
selected to examine the range of VLCFA proportional re-distribution induced by expression of the 
nasturtium FAE gene (Figure 5A and B). Tlie cnicic acid content was mcreased by iq> to 6.5-fold 
in line NF-8. Small increases in the proportions of 24:1 Acl5 were also observed (Table HQ. 
There was also a relatively significant increase in the proportions of the saturated VLCFAs, 22:0 
and 24:0, at the expense of 18:0 and 20:0. In both the case of the VLC mono-unsatuiated fitter 
acids (Fig 5A) and the VLC saturated fatty acids (Fig SB), the highest proportional increases in 
erucic and in [behenic + lignoceric] acids were generally correlated with the concomitant 
reduction in the proportion of their corresponding elongase jmmers, eicosenoic and [stearate + 
arachidic] acids, respectively. 

Therefore, we conclude that the nasturtium FAE is able to preferentially elongate 20:1 
and [18:0 + 20:0]. As would be expected, th^e was significant variation m the proportions of 
22: 1 which accumulated OFigure 5 A) possibly due to positional efifects fi^om nasturtium FAE 
transgene tnsotion at different sites in the Arabidopsis genome. Similar variations wm observed 
in the expression of a castor fetty acid hydroxylase gene iCFAH12) in flie Arabidopsis fad2/fael 
mutant 

In summary, we have isolated a cDNA clone firom nasturtium which exhibits high 



similarity to the sequences of 3*etoa0yl-CoA synthases ftom various plant species but has the 
unexpected benefit of increasing the erucic acid content by 6-fold. 

Our in vitro jGndings suggest that the FAB proteins in a 15,000;c g nasturtium particulate 
fraction have a broad acyl-CoA preference, with the ability to elongate both monounsaturated and 
saturated Cib-CoA and C20-C0A substrates. In like manna-, a partially purified jojoba FAEl 
showed maximal activity with monounsaturated and saturated Cia and C20-C0A5 in vitro. 
However, it is important to note that the particulate elongation activity rqjorted in tiie current 
study most likely represents flie cumulative effect of expression of more than one member of this 
small gene £unily. Thus, from this expmment one can only conclude that the capacity to elongate 
bofli monounsaturated and saturated acyl moieties is represented in this nasturtium particulate 
fraction. 

While genetic analyses and homology assessments might predict that fte isolated 
nasturtium FAE gene migjht encode an enzyme which prefers to elongate saturated acyl*CoAs, the 
transgenic experiments in tobacco callus, tobacco leaves and in Arabidopsis seed, collectively 
confirmed that Ae heterologously-expressed 71 majus FAB can elongate botti monounsaturated 
and saturated acyl moieties. In &ct, in a transgenic Arabidopsis bacl^und, the nasturtium FAE 
was much stronger than the jojoba 3-KCS in its ability to increase the level of 22:1: Litroducing 
the jojoba cDNA into Arabidopsis resulted in an increase m 22:1 prqjortions ftom about 2% in 
the control to 4% in the transgenics. In comparison, when we introduced flie T. majus FAE into 
Arabidopsis, the erucic acid content increased by almost an order of magnitude (frfold) at the 
concomitant expense of 20:1 All, The acyl compositionof the transgenic ilraWrfo/w/s seed oil 
was reproportioned such that erucic and eicosenoic became about equal as the two predominant 
VLCFAs. 

The ability of the nasturtium FAE protein to preferentially elongate 18:I-CoA and 
especially 20:l-CoA, is consistent the observed acyl composition of nasturtium seed oil which 
consists primarily of very long chain- and specifically erucoyl moieties. We postulate therefore, 
that whether the nasturtium FAE transgene results in predominantly mono-unsaturated (20: 1 Al I, 
22:1 A13) or saturated (e.g. 20:0, 22:0) VLCFAs is more a function of the composition of the 
acyl-CoA pool (18:1 A9 and 20:1 Al 1 or 18:0 and 20:0 or, respectively) available to the 
condensing enzyme in the host species/target orpn. 

Thus, the nasturtium FAE homolog described herein, will have a larger engineering 
impact when strongly expressed in a seed-specific manner in E£A. Brassicaceae (e.g. A napus; 
B. carinata) wherein 18:1 A9 [and 18:2/18:3] and 20:1 All represent a potential acyl-CoA 
elongation substrate pool of almost 40% over and above the existmg 45% 22:1 A13 content 



Clearly, the current studies indicate that the nasturtium FAE e3q)ression should be combined wifli 
other genetic modifications we have made to enhance the VLCFA content of HEAR Brassicaceae 
and the proportions of erucic acid in particular, to provide an industrial feedstock oil of high value 
and broad applicability. 

A major goal of our research is to obtain, by genetic manipulation, Brassica napusL. 
cultivars with higher levels of «rucic acid (22: 1) in their seed oil ttian in present Canadian HEA 
cultivars. We would like to develop a 5. napuscyMvsr containing erucic acid levels above 66 
mol%, ideally wifli more than 80% erucic acid in the seed oil. To achieve our goals we are 
isolating new, more efficient strategic genes for high erucic acid and preferably, trierucin, 
production. We selected Tropaeolum majus, garden nasturthmi. as a source of those genes based 
on the fict that ttiis plant is capable of producing significant amoimts of erucic acid (7&7S % of 
total fetty acid) and accumulates trierucin as the predominant TAG in its seed oil. The fatty acid 
elongase (FAE), 3-fcetoacyl-CoA synthase (KCS) is the first component of tiie elongation complex 
involved in synthesis of erucic add (22:1) in seeds of Tropaeolum wq/iis fearfen nasturtium). 
Using a degenerate primm approach, a cDNA of an embiyo FAE was obtained and 
heterolpgously expressed in two differentplant backgrounds (^^ ikalianamdK tetecKBi) under 
the control of a seed-specific (napm) promoter and the constitutive (tandem 35S) promoter, 
respectively. Seed-specific expression resulted in up to a 6-fold increase in erucic acid proportions 
in Arabidopsis seed oil Constitutive expression in transgenic tobacco tissue resulted in increased 
proportions of very long ctein saturated fatty acids. These results indicate that the nasturtium FAE 
gene encodes a condensing enzyme involved in the biosynthesis of very-long-chain fatty acids, 
utilizing monounsaturated and saturated acyl substrates. Thus, the nasturtium FAE homolog will 
have a larger engineering impact when strongly expressed in a seed-specific manner in H.Ej\. 
Brassicaceae (e.g. B, napus) wherein 18:1 A9 [and 18:2/18:3] and 20:1 Al 1 represent a potential 
acyl-CoA elongation substrate pool of almost 40% over and above the existing 45% 22:1 A13 
content 

In addition, heterologous expression of the nasturtium FAE gene in HEAR Brassicaceae 
can be combined with other genetic modifications we have made to enhance the VLCFA content 
of HEAR germplasm (Katavic et al, 2001; Taylor et al., 2001) and the proportions of erucic acid 
in particular, to provide an industrial feedstock oil of high value and broad qiplicability. 

Expression of nasturtium FAE in Arabidopsis seedsresulted in a 6-fold increase in erucic 
acid content Therefore, it is anticipated that the introduction of this gene alone, or m combinatiOT 
with other altered gene expression phenotypes (e.g. FAEl and/or FAD2 and/or FADS) into HEAR 
Brassicaceae will result in transgenic lines with improved proportions of erucic add in the seed 



oil. 

Example 9 

Heterologous Expression of the nasturtium FAE In HEAR Brasslcaceae 

Hie nasturtium FAE gene under the control of the strong seed-specific promoter napin, 
has been introduced into HEAR Brassicaceae (e.g. 5. napus ; A carinata). Considering the 
results obtained in Arabidopsis seeds, it is anticipated that thwe will be a strong increase in the 
proportion of 22: 1 and saturated VLCFAs as well (by up to 10%). 

Example 10 

Heterologous expression in HEAR Brassicaceae (e.g. B. napus or B. carinata) co- 
transformed vrith the nz^iniAihalFAEH mpiatNasiFAE or crosses of Individual A thai 
FAE transgenic lines with nasturflum FAE transgenic Ones. 

Expression of nasturtium FAE in HEAR Brassicac&ie (e.g. B. napus; B. carinata) and 
the resulting proportional increase in erucic acid can be maxunized by also addressing the fact that 
20: 1, the prefeired monounsaturated substrate, is present in wild type seeds in relatively low 
proportions (5.5-6.5%). Therefore, for example, one can introduce fhc Arabidopsis FAE J and 
nasturtium FAE into HEAR Brassicaceae (e.g. B. napus; B. carinata). The first gene product 
should enhance conversion of 18:1 to 20:1 (Katavic etal., 2001), while the nasturtium gene 
product cleariy prefers to extend 20:1 to 22: 1. In this manner, tiie maximal proportion of emcic 
acid is expected. To achieve this goal, one could zpply a co-transformation method: The 
Arabidopsis FAE is cloned m a derivative of vector pRD400 which allows selection on 
kanamycin. while the nasturtium FAE is cloned in pCAMBIA vector which allows selection on 
hygromycin. Alternatively, individual transgenic lines homologous for the insertion of ii. thaliana 
FAEl could be crossed with nasturtium FAE transgenic lines. 
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Claims 

1 . An expression vector for transforming a plant cell, said expression vector comprising a 
gene coding for a nasturtium fatty acid elongase gene in reading frame alignment with a promoter 
capable of increasing expression of said gene, when said transformed plant cell is in a seed, 
sufficient to increase in proportion of vciy long chain monounsaturated fatty acid when compared 
with a control plant cell 

2. A plant cell comprising a heterologous gene coding for a nasturtium fatty acid elongase 
gene or allelic variant thereof, said plant cell being capable of producing an increase in proportion 
of a v^ long chain monounsaturated &tty acid whoi compared with a control plant cell lacking 
said heterologous gene. 

3. A seed comprising a plurality of plant cells according to claim 2. 

4. A plant comprismg a plurality of plant cells according to claim 2. 

5. A plant cell according to claim 2 wherein said heterologous gene codes for a 3* 
ketoacyl-CoA synthase. 

6. A plant cell according to claim 2 whmin said very long chain monounsaturated fatty 
acid comprises erucic acid. 

7. A plant accordmg to claim 4 wherein said plant is a dicotyledon. 

8. A plant according to claim 4 wherein said plant is a member of the Brassicaceae. 

9. A plant according to claim 4 wherein said plant is a member of fhsLimnanthaceae or 
Tropaeolaceae or Simmondsia 

10. A plant according to claim 8 whmm said plant is of Ae Brassica genus. 

1 1 . A method for altering erucic acid content of a plant-derived oil which method 
comprises cultivating a plant according to claim 4 and Aen extracting a plant^erived oil 
therefrom i^ch oil has altered erucic acid conteit 



12. Use of nasturtium fetty acid elongase gene for altering erucic add content in a plant 



Abstract 



This invention relates to seeds of plant, plants themselves and cells of such plants which 
comprise a heterologous gene coding for a nasturtium fatty acid elongase gene or allelic variant 
thereof, said plant or seed being capable of producing an increase in proportion of a very long 
chain monounsaturated fatty acid when compared with a control plant or seed lacking said 
heterologous gene. 
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Rgure 2 B. Dendrogram of the ketoacyl-CoA synthase gene family based on the amino acid sequences 

The ^gnment was carried out by the Clustal W method usmg Lasergene analysis software (DNAStar, Madison WI) 

Jr^rSwS^^^ TT^t (NasFAE), AJ292770 (Ze'aFAi), 

AF053345 (AraKCS). U29142 (AraFAE), AF009563 (BraFAE), AF129511 (AiaACUT) 
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Figure 3. Hydropathy analysis of T.majus FAE. (A) Hydropathy plot of FAE indicating the 
presence of several hydrophobic regions. (B) Schematic representation of the putative 
transmembrane domains of T.majt45 FAE amino-acid sequmce as predicted by TMAP 
analysis [Persson, Aigos 1994J. Numbers shown in the boxes correspond to the residues 
ofeadi domain m FAE. ' 
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Figure 4. Northern and Southern analyses of 7.»w/ttr/W£. 

^ m l^f ^'"^ expression in T.majus. Total RNA was isolated fiom roots (RT) 

leaves (LF), petals (PL) and embryos (EO). ^ ^' 

B. Southern blot analysis of the FAE gene in T.majus. Genomic DNA was digested with restriction 
enzymes: EcoBl Oane 1 ). Acd (lane 2), M»I (lane 3) and HindBl (lane 4) 



SEQUENCE LISTING 
<110> National Research Council of Canada 

<120> Nasturtium Fatty Acid Elongase (FAE) gene and its use in inreasing erucic 
acid content 

<130> Pat 989P-2 

<140> 

<141> 

<160> 15 

<170> Patent In version 3.1 

<210> 1 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> F- forward primer 
<400> 1 

tctwggwggm atgggttg 

<210> 2 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Coded by F-forward primer 
<400> 2 

Leu Gly Gly Met Gly Cys 
1 5 

<210> 3 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> F-reverse primer 



<400> 3 

tdtaygcyar ctcrtacc 



18 



<210> 4 
<211> 6 



<212> PRT 

<213> Artificial 



<220> 

<223> Coded by F-reverse primer 
<400> 4 

Trp Tyr Glu Leu Ala Tyr 
1 5 • 



.<210> 5 

<2H> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> P-forward primer 
<400> 5 

accatgtcag gaacaaaagc 

<210> 6 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> PR-reverse primer 
<400> 6 

ttaatttaat ggaacctcaa ccg 

<210> 7 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Fl-forward primer 
<400> 7 

tcgaggatgt cgcttcaccg atttggaaac ac 

<210> 8 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Rl-reverse primer 
<400> 8 



gtttccaaat cggtgaagcg acatcctcga tgg 



<210> 9 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> BF-forward primer 
<400> 9 

taggatccat gtcaggaaca aaagc 

<210> 10 

<211> 30 

<212> DMA 

<213> Artificial 

<220> 

<223> SR-reverse primer 
<400> 10 

tagagctctt aatttaatgg aacctcaacc 

<210> 11 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> BR-reverse primer 
<400> 11 

taggatcctt aatttaatgg aacctcaacc 

<210> 12 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Pl-forward primer 

<400> 12 

atgtcaggaa caaaagc 

<210> 13 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 



<223> P2-reverse primer 
<400> 13 

taatttaatg gaacctcaac eg 



<210> 14 
<2H> 503 
<212> PRT 

<213> Tropaeolum majus 
<400> 14 

Met Ser Gly Thr Lys Ala Thr Ser Val Ser Val Pro Leu Pro Asp Phe 
15 10 15 

Lys Gin Ser Val Asn Leu lys Tyr Val Lys Leu Gly Tyr His Tyr Ser 
20 25 30 

He Thr His Ala Met Tyr Leu Phe Leu Thr Pro Leu Leu Leu He Met 
35 40 45 

Ser Ala Gin He Ser Thr Phe Ser He Gin Asp Phe His His Leu Tyr 
^" 55 60 

Asn His Leu He Leu His Asn Leu Ser Ser Leu He Leu Cys He Ala 
'0 75 80 

Leu Leu Leu Phe Val Leu Thr Leu Tyr Phe Leu Thr Arg Pro Thr Pro 

85 90 gg 

Val Tyr Leu Leu Asn Phe Ser Cys Tyr Lys Pro Asp Ala He His Lys 
"'0 105 no ' 

cys Asp Arg Arg Arg Phe Met Asp Thr He Arg Gly Met Gly Thr Tyr 

120 125 

Thr Glu Glu Asn He Glu Phe Gin Arg Lys Val Leu Glu Arg Ser Gly 

135 140 

He Gly Glu Ser Ser Tyr Leu Pro Pro Thr Val Phe Lys He Pro Pro 
150 155 160 

Arg Val Tyr Asp Ala Glu Glu Arg Ala Glu Ala Glu Met Leu Met Phe 
1" 170 175 



61y Ala Val Asp Gly Leu Phe Glu Lys lie Ser Val Lys Pro Asn Gin 



190 



He Gly Val Leu Val Val Asn Cys Gly Leu Phe Asn Pro He Pro 



195 



200 



Ser 



205 



Leu Ser Ser Met lie Val Asn Arg Tyr Lys Met Arg Gly Asn Val Phe 



215 220 
Ser Tyr Asn Leu Gly Gly Met Gly Cys Ser Ala Gly Val He Ser He 



230 



235 



240 



Asp Leu Ala Lys Asp* Leu Leu Gin Val Arg Pro Asn Ser Tyr Ala Leu 
245 250 255 

Val Val Ser Leu Glu Cys He Ser Lys Asn Leu Tyr Leu Gly Glu Gin 
260 265 270 

Arg Ser Met Leu Val Ser Asn Cys Leu Phe Arg Met Gly Gly Ala Ala 

280 285 



He Leu Leu Ser Asn Lys Met Ser Asp Arg Trp Arg Ser Lys Tyr Arg 



295 



300 



Leu Val His Thr Val Arg Thr His Lys Gly Thr Glu Asp Asn Cys Phe 



310 



315 



320 



Ser Cys Val Thr Arg Lys Glu Asp Ser Asp Gly Lys He Gly He Ser 
325 330 

Leu Ser Lys Asn Leu Met Ala Val Ala Gly Asp Ala Leu Lys Thr Asn 
340 345 350 

He Thr Thr Leu Gly Pro Leu Val Leu Pro Met Ser Glu Gin Leu Leu 

360 365 

Phe Phe Ala Thr Leu Val Gly Lys Lys Val Phe Lys Met Lys Leu Gin 



375 



380 



Pro Tyr He Pro Asp Phe Lys Leu Ala Phe Glu His Phe Cys He His 



390 



395 400 
Ala Gly Gly Arg Ala Val Leu Asp Glu Leu Glu Lys Asn Leu Lys Leu 



405 



410 



415 



Ser Ser Trp His Met Glu Pro Ser Arg Met Ser Leu Tyr Arg Phe Gly 
420 425 430 

Asn Thr Ser Ser Ser Ser Leu Trp Tyr Glu Leu Ala Tyr Ser Glu Ala 
435 440 445 

Lys Gly Arg He Lys Lys Gly Asp Arg Val Trp Gin He Ala Phe Gly 
450 455 460 

Ser Gly Phe Lys Cys Asn Ser Ala Val Trp Lys Ala Leu Arg Asn Val 
465 470 475 480 

Asn Pro Ala Glu Glu Lys Asn Pro Trp Met Asp Glu He His Leu Phe 



Pro Val Glu Val Pro Leu Asn 
500 

<210> 15 
<211> 1765 
<212> DNA 

<213> Tropaeolum ma jus 
<400> 15 

agtttttttt gttgagaata accatgtcag gaacaaaagc aacatcagtt tctgttccac 60 
tgcctgattt caagcaatca gttaatctaa aatatgttaa acttggttat cattactcga 120 
tcactcatgc aatgtatctt tttctaaccc ctcttcttct cataatgtct gctcaaatct 180 
caactttctc tattcaagat tttcaccatc tttataacca tcttatcctc cacaatctct 240 
catcccttat cctatgcatc gctctcctcc tcttcgtctt aaccctctat ttccttactc 300 
gtcccacgcc tgtttattta ctc^acttct cttgttacaa accggatgct attcacaaat 360 
gcgaccgccg tcgtttcatg gacaccattc gtggaatggg tacttatacg gaagagaaca 420 
tcgagtttca aaggaaagtt ctagaaaggt ccggaatagg ggaatcgtct tatcttcctc 480 
cgactgtgtt taaaattcct cctagggttt acgatgcgga ggaacgcgcg gaggctgaga 540 
tgctgatgtt cggtgcggtt gatgggcttt tcgagaaaat atctgttaaa ccgaatcaaa 600 
tcggggtttt ggttgtgaat tgtgggttgt ttaatccgat accgtcttta tcttccatga 660 
ttgtgaatcg ctacaagatg agagggaatg tttttagtta taatttgggt ggaatgggtt 720 



gtagtgcggg tgtgatttcg attgatcttg ctaaagatct tcttcaggtt cgtcccaact 780 

catatgcttt ggtggttagt ttggaatgta tctcgaagaa cttgtatctc ggtgaacaaa 840 

gatcgatgct tgtttccaac tgtttgtttc gaatgggtgg ggcggcgatt ttgctttcga 900 

ataaaatgtc ggatcgatgg agatcaaagt atagattggt tcatacggtt cgaacccaca 960 

agggtaccga ggataactgc ttttcttgcg taactagaaa ggaagactcg gacgggaaga 1020 

tcggtatttc tttatcgaag aacctaatgg ctgttgccgg agacgcattg aagactaata 1080 

tcacaaccct cggaccactt gttctaccca tgtcggaaca attactcttc ttcgctactt 1140 

tggtcggaaa aaaggttttc aagatgaagc tacagccgta tataccggat ttcaagttgg 1200 

ctttcgagca tttctgtatt catgcaggtg gaagagctgt tctggatgaa ttggagaaga 1260 

acttgaagct ttcgagttgg catatggaac catcgaggat gtcgctttac cgatttggaa 1320 

acacgtcgag tagttcgctt tggtacgagt tggcttattc ggaggcgaaa gggagaataa 1380 

agaagggaga tcgagtatgg caaatcgcgt ttgggtcggg atttaagtgt aacagtgcgg 1440 

tgtggaaggc tctaaggaat gttaatccgg cggaagagaa aaatccttgg atggStgaga 1500 

ttcacctatt tccggttgag gttccattaa attaaaacct atcttcaagt tacaagttgt 1560 

tgttgttgtt tcattaggtt taataataag ctaatatgga aagcctttct actctctttt 1620 

ttttccactt ttttttttca atttcagagt tgggtcttag ttgtatcatc tacatgagtg 1680 

tattcgctat gcgctattcg ctattcgcta ttcactagtt aataaaatca aacgtccaaa 1740 

aaaaaaaaaa aaaaaaaaaa aaaaa 1765 
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